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Abstract 

Background: The Estrogen Receptor alpha (ERa) is the l<ey transcriptional regulator in lunninal breast cancer and is 
therefore the main target for adjuvant treatnnent of this subtype. Lunninal gene signatures are dictated by the 
transcriptional capacities of ERa, which are a direct consequence of the receptors binding preference at specific 
sites on the chromatin. The identification of ERa binding signatures on a genome-wide level has greatly enhanced 
our understanding of Estrogen Receptor biology in cell lines and tumours, but the technique has its limitations with 
respect to its applicability in limited amounts of tumour tissue. 

Results: Here, we present a refinement of the ChlP-seq procedures to enable transcription factor mapping on 
limited amounts of tissue culture cells as well as from a limited amount of tumor tissue derived from core needle 
biopsies. Our approach uses a carrier that can be removed prior to DNA amplification and sequencing. 

Conclusion: We illustrate the applicability of this refined technology by mapping the ERa genome-wide chromatin 
binding landscape in core needle biopsy material from primary breast tumours. With this, our refined technology 
permits for a high-resolution transcription factor mapping even from clinical samples. 



Background 

Breast cancer is the most common diagnosed malig- 
nancy in women, with over 1.4 milUon new cases world- 
wide annually [1]. 75% of all breast cancers are from the 
luminal subtype [2] for which tumour growth is thought 
to be dependent on the activity of the Estrogen Receptor 
alpha (ERa). This growth-dependency renders this path- 
way the main target for adjuvant endocrine treatment in 
luminal breast cancer. 

The physiological behaviour of the ERa involves bind- 
ing of the receptor to its natural ligand estradiol, after 
which the receptor associates to the chromatin, recruits 
its coregulators and alters the transcriptional activity of 
responsive genes, leading to increased cell proliferation 
and tumour growth. In endocrine treatment of breast 
cancer, the activation of the receptor can be inhibited 
through multiple ways, each of which resulting in an 
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inhibition of ERa-driven cell proliferation. Even though 
these endocrine intervention therapies greatly increase 
the disease-free survival and overall survival of breast 
cancer patients [3], resistance to treatment is commonly 
observed. This resistance can be mediated through a 
multitude of different mechanisms, including differential 
expression of kinases [4,5], coregulators [3,6] and trans- 
membrane receptors [7,8]. 

Importantly, it is becoming apparent that mechanisms 
of resistance may result from intrinsic effects on ERa/ 
chromatin interactions, as shown both in cell lines [9,10] 
and in tumour samples [11]. Since the ERa/chromatin 
interactome directly determines the estradiol-mediated 
effects on gene expression [12], distinct binding patterns 
may be a key-defining factor for deviating gene expres- 
sion patterns as well [13]. A major hurdle to assay such 
transcription factor/chromatin interactions, is the fre- 
quently very limited starting material due to the minimal 
amount of tissue that is obtained during diagnostic 
work-up. 

Recently approaches have been described to map his- 
tone modifications in a limited amount of starting 
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material [14], but no such protocol has been described 
for transcription factors. Similarly, new methods for 
amplification of limited material have been described 
[15], but these assume that sufficient DNA was recov- 
ered from the ChIP to enable amplification in the first 
place. One of the major technical limitations has been 
enriching sufficient DNA (via ChIP) from limited 
starting material. This problem has been addressed using 
'carrier chromatin' [16], a strategy that is not amenable 
to global sequencing approaches. Here, we describe a 
procedure to enhance transcription factor ChlP-seq by 
incorporating a modified carrier method to enable ChIP- 
seq from limited starting material. We apply this method 
to map the ERa/chromatin interactome in as few as 
10,000 tissue culture cells and apply this technology to 
map the ERa chromatin binding landscape in core nee- 
dle biopsy samples from breast cancer patients. 

Results and discussion 

Carrier selection for small cell number ChlPs 

To optimize the Chromatin Immunoprecipitation (ChIP) 
procedure from a limited amount of cells, we utilized 
the well-characterized (ERa-positive and Progesterone 
Receptor-positive) breast cancer cell line MCF7. MCF7 
cells have been widely applied for ChlP-seq applications, 
and multiple datasets are publicly available to be used as 



a reference. Proliferating MCF7 cells from a sub-confluent 
10 cm dish were harvested, after which the total amount 
of cells were counted. The lysate from 10,000 cells was 
used for ChIP. Initially a focused ERa ChlP-real time PCR 
approach was taken, using the well-studied pS2 promoter 
region. ERa ChIP was performed using standard methods 
[17], but in the absence of carrier, no enrichment from the 
Chip sample was observed (Figure lA). In addition, two 
independent carriers were tested: glycogen and mRNA/ 
histones. The rationale was to test carriers that were either 
inert (glycogen; not having any effect on the chromatin 
nor sequencing procedure) or had a composition that 
could be removed from the sample (mRNA and histones; 
to be removed by RNAse A and Proteinase K). The car- 
riers were added to the lysate mixture before antibody- 
conjugate addition. Glycogen proved to have a small, but 
not significant increase in ChIP efficiency, when using the 
pS2 promoter region. The addition of recombinant his- 
tones and random human mRNA led to a significant in- 
crease of ERa signal at the pS2 promoter (Figure lA). 
This enhanced enrichment was achieved both by an in- 
creased Chip efficiency for specific signal as well as a de- 
crease of the nonspecific background binding (Additional 
file 1: Figure SI). Combining the two carrier procedures 
resulted in a decreased signal as compared to only adding 
mRNA/histones. 
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Figure 1 Carrier-optimized ChlP-seq in IVICF7 cells. A. ChlP-qPCR on pS2^FFl promoter for ERa with differential carrier conditions (none, 
glycogen, mRNA/histones, glycogen/mRNA/histones). Data were normalized over negative control region and input. Bars show standard 
deviation from triplicate measurements. B. Genome browser snapshot of ERa ChlP-seq, depicting the pS2^FFl promoter and enhancer region for 
10,000 cells without (top) or with mRNA/histones carrier (middle). Data was compared to a saturated ERa ChlP-seq from 20''10^ cells (bottom). 
Genomic coordinates and tag count are indicated. C. Heatmap visualization of ERa ChlP-seq data, depicting all binding events centred on the 
peak region within a 5 kb window around the peak. All peaks for the 20^10^ cells ChlP-seq condition were ranked on intensity, and the data from 
both 10,000 cells-conditions were plotted in identical order. 
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Even though the increased enrichment at the pS2 pro- 
moter was indicative of increased ChIP efficiency, this 
cannot be generalized towards all ERa/chromatin inter- 
actions. Therefore, the ERa ChlP-samples were further 
processed and prepared for sequencing. Only the two 
extreme conditions (no carrier versus mRNA/Histones 
carrier) were sequenced and compared to a publicly avail- 
able dataset of proliferating MCF7 cells, performed on 
-20 million cells [10]. The carrier mRNA and histone 
were degraded prior to amplification of enriched ChIP 
DNA. An example of difference in binding capacity in the 
presence and absence of carrier mRNA/histones is shown 
in Figure IB and all binding events are illustrated as a 
heatmap in Figure IC. Importantly, we ranked all ERa 
binding peaks observed in the sample with -20 million 
cells and found that -60% of these were recapitulated 
when using the mRNA and histones as carrier from 
10,000 cells. This is a substantial improvement compared 
to ERa ChlP-seq without carrier, where only signal was 
detected at -20% of peaks. Heatmap visualisations and 
quantifications indicated that peak calling on 10,000 cells 
with RNA/histones carrier provided comparable signal 
intensity distributions as the saturated ChIP. This in 
contrast to the 10,000 cells analysis without carriers 
(Additional file 2: Figure S2) where mainly background 
was found. Grouping the ChIP distributions on the basis 
of signal intensities showed that peaks with high- intensity 
signal under the RNA/histones carrier ChIP conditions, 
consistently provided high signal distributions for the sat- 
urated ChIP, and vice versa (Additional file 3: Figure S3). 
No subsets of false-positive and false-negative regions 
were observed (Additional file 3: Figure S3). Interestingly, 
the dynamic range of signal intensity as detected in the 
saturated ChIP was dampened under the carrier condi- 
tions (Additional file 3: Figure S3A). Cumulatively, these 
data show that the addition of carrier chromatin facilitates 
transcription factor ChlP-seq from 10,000 cells, without 
negatively effecting the amplification or global fidelity of 
the process. 

In order to confirm the biological validity of the ERa 
binding events mapped using mRNA/histone as carrier 
for ChlP-seq from 10,000 cells, we integrated the 
binding information with estrogen (E2) -mediated gene 
expression profiles. For all E2-regulated genes, we deter- 
mined whether our altered ChIP protocol enabled the 
identification of proximal ERa as compared to the satu- 
rated Chip (Figure 2). We used a publically available 
gene expression microarray dataset, from MCF7 cells 
treated for 6 hours with E2 [13] and found 1,821 tran- 
scripts up-regulated and 1,812 down-regulated by E2. A 
window of 20 kb around the transcription start-site was 
applied to determine proximal ERa binding (which has 
previously been shown to be the optimal window be- 
tween ERa binding events and promoters of target 



genes [12]). For the saturated ChlP-seq, approximately 
60% of all E2 up- and downregulated genes had a prox- 
imal ERa binding event (Figure 2A), which is consistent 
with our previous report [13]. Using the ERa binding 
events discovered from 10,000 cells with the mRNA/his- 
tones as carrier, we found that 38% of the E2 up- 
regulated genes had an adjacent ERa binding event 
(Figure 2A), a figure under-represented because of the 
fewer ERa binding events in this category. We also ranked 
the strongest gene-proximal ERa binding events from the 
mRNA/histone carrier-sample based on tag count, resulting 
in binding at well-described ERa target genes, including 
TFFl, RARA, GREBl and XBPl (Figure 2B). 

Carrier-ChIP from core needle biopsies of human breast 
tumours 

Given the findings that mRNA/histone carrier appeared 
to assist ChlP-seq from limited cell numbers, we applied 
this approach to clinically relevant samples. We chose to 
study core needle biopsy samples from breast cancer pa- 
tients, since they represent clinically available, but limited 
material that would be appropriate for our approach. 
These samples were all positive for Estrogen Receptor, as 
determined by immunohistochemistry (Additional file 4: 
Table S2). We tested the two carrier conditions independ- 
ently; 1. glycogen 2. mRNA in combination with recombin- 
ant Histone 2B. ERa ChlP-seq was performed in all core 
needle biopsies. When using glycogen as a carrier, a small 
level of enrichment was observed from the biopsy samples, 
as exemplified at multiple binding sites (Figure 3A). Im- 
portantly, in the presence of mRNA/histone carriers, the 
enrichment at both the example regions and globally, was 
significantly increased. This resulted in 669 ERa peaks in 
the presence of glycogen and 3,366 peaks in the presence 
of mRNA/histone carrier for two biopsies with directly 
comparable clinical parameters (Additional file 4: Table 
S2). Importantly, in the presence of mRNA/histone carrier, 
ERa ChlP-seq signal was comparable to the level observed 
in the cell line (Figure 3A), suggesting that the presence of 
the carrier enables increased ERa ChlP-seq signal, even 
when applied to limited samples obtained from core needle 
biopsies. 

ERa binding events are rarely found at promoters (5%) 
and the majority of binding sites is normally enriched at 
introns and distal intergenic regions [18]. We analysed 
the binding site localizations and found that the mRNA/ 
histones assisted ChlP-seq showed comparable genomic 
distributions compared to cell line data (Figure 3B). 
Motif analyses were performed for the mRNA/histone 
assisted ChIP condition (Figure 3C). As expected, ER 
DNA binding motifs were the most strongly enriched 
motifs. 

Since ERa is the driving transcription factor in luminal 
breast cancer, we interrogated the genes proximal to the 
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Figure 2 Proximal Estradiol-responsive genes and carrier ERa ChlP-seq enrichment. A. Estradiol up- (white) and down-regulated (black) 
genes were interrogated for proximal ERa binding events under saturated conditions with 20*10^ cells (right), as well as ChlP-seq on 10,000 cells 
without (left) and with mRNA/histones carrier (middle). The percentage of E2-reponsive genes with a proximal ERa binding site is shown. B. ERa 
binding events with mRNA/histones carriers were ranked and proximal E2-upregulated genes were determined for the top 10 highest intensity 
binding sites. Heatmap indicates tag count. 
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ERa binding events, and determined whether these were 
previously identified as basal or luminal gene signatures 
in independent cohorts of breast cancer patients [2] 
(Figure 3D). For the glycogen carrier condition, none of 
these hallmark genes were found to have a proximal 
ERa binding event for any of the tumour samples tested 
(data not shown). For the mRNA/histone-assisted condi- 
tion, an enrichment of luminal genes was found over 
basal-signature genes (Figure 3D). Expression of these 
signature genes with proximal ERa binding events was 
found to correlate with a favourable outcome after endo- 
crine treatment in a meta-analysis of breast cancer patients 
with ERa-positive disease [19] (Figure 3E), indicating that 
our carrier ChlP-seq protocol enabled us to identify direct 
ERa target genes with clinical implications from core nee- 
dle biopsy samples. 

Conclusion 

The genome-wide assessment of ERa/chromatin interac- 
tions has been performed in tissue culture cell lines [18] 
and primary breast cancer tissue [11]. Even though 
ChlP-seq is a very powerful technology to delineate tran- 
scription factor-chromatin interactions and to identify 
specific locations of histone marks on a genome-wide 
level, the technology does have its intrinsic limitations. 
The availability of a specific antibody and applicable for 
immunoprecipitation from formaldehyde-fixed material 
are major limitations. In addition, there are potential is- 
sues related to epitope masking. Another major limita- 
tion of ChlP-seq analyses is the need for a certain 
amount of starting material. Reports have described 
ChlP-seq analyses on histone marks on limited starting 



material [14], but these represent relatively stable cova- 
lent interactions, as compared to the dynamic and un- 
stable interactions between transcription factors and 
chromatin. In directed ChIP analyses, the benefit of 
carrier during the immunoprecipitation procedure has 
proven to be substantial [16]. In that method, non- 
relevant chromatin (e.g. drosophila DNA), is added as a 
carrier during the procedure, in order to enhance the 
enrichment from the limited but relevant cellular mater- 
ial. While this approach can be applied for qPCR-ChIP, it 
is not readily applicable for ChlP-seq, which will sequence 
all DNA, regardless of the species, resulting in substantial 
dilution of the relevant enriched DNA. How the addition 
of the carriers exactly enhances the ChIP procedure on a 
molecular level still needs to be resolved. Most likely, the 
bulky material of the carrier helps to retain the small 
amounts of relevant chromatin throughout the isolation 
procedure. In addition, it might prove an additional bene- 
fit of blocking nonspecific antibody/protein interactions, 
thereby increasing substrate specificity of the antibody in 
the reaction to reduce background signal. 

Here, we introduce a novel mode of carrier-assisted 
Chromatin Immunoprecipitation that can be applied to 
transcription factor mapping by ChlP-seq. The enrich- 
ment from limited cellular material is enhanced by the 
addition of mRNA and recombinant histone protein, 
which function as a carrier. Importantly, both compo- 
nents of the carrier can subsequently be removed and 
provide an approach that is cost effective, easily applied 
and applicable in next-generation sequencing reactions. 

We applied this approach to map ERa in 10,000 asyn- 
chronous MCF7 cells, which results in an improved 
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Figure 3 Carrier CiilP-seq analyses on breast tumour core needle biopsies. A. Genome browser snapshots of ERa ChlP-seq on MCF7 cells 
(black) and breast cancer core needle biopsy samples with glycogen (blue) or mRNA/histones (red) as carriers. Tag count is shown on the Y-axis. 
Genomic locations are indicated. B. Genomic distributions of ERa binding events in MCF7 cells (top) and breast tumour biopsy samples with 
mRNA/histones as carrier (bottom). C. Motif analysis of ERa binding events on a core needle biopsy sample, with mRNA/histones as carrier. 
Indicated are the top 3 enriched motifs and p-values. D. Luminal enrichment for ERa carrier ChlP-seq genes. Genes, identified as 'luminal' or 
'basal' signatures by Perou et al. [2] were analysed for proximal ERa binding events. Enrichment of 'luminal' over 'basal' genes was found. E. 
Genes, identified in D., were tested for correlation with recurrence-free survival, using Estrogen Receptor positive samples from a meta-analysis of 
breast cancer patients [19]. Expression of identified genes correlated with a favourable outcome after endocrine treatment. 



DNA recovery, when compared to no carrier. Although 
this recapitulated -60% of the signal observed when 
performing ERa ChlP-seq in saturating numbers of cells, 
these possessed all the expected characteristics of robust 
ERa binding events. The dynamic range of the ChIP sig- 
nal intensity was dampened under carrier conditions, 
possibly implying that weaker ChIP enrichment regions 
are enhanced in our method. Furthermore, as a proof-of- 
principle, we applied our carrier-assisted ChIP protocol 
to limited material derived from core needle biopsy sam- 
ples, isolated from breast cancer patients. Again, the 
presence of the mRNA/histone carrier resulted in 



increased ERa ChlP-seq signal, allowing for discovery of 
clinically and biologically relevant ERa regulatory do- 
mains from these samples. Several approaches have been 
recently described [15,20] that enhance amplification 
from limited DNA. Although these are powerful 
methods, they assume that there is sufficient DNA 
enriched by ChIP, but currently, methods do not exist 
that enable the ChIP enrichment from limited material 
for integration with these novel amplification approaches. 
We believe that our modified carrier ChlP-method im- 
proves the ability for purifying transcription factor- 
associated DNA from limited material, such as tumour 
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samples. The potential exists for coupling our modified 
Chip approach with the novel amplification methods for 
greater ChlP-seq fidelity and signal from primary mater- 
ial, potentially permitting high-resolution transcription 
factor mapping even from clinical samples. 

Methods 

Chromatin Immunoprecipitations 

Chip was performed as described before [17], but 
with a number of modifications. During the Immuno- 
precipitation, 100 (ig/ml glycogen (Invitrogen) or 20 [igl 
ml recombinant histone 2B (M2505S; New England 
Biolabs) and 1 (ig/ml human mRNA (Invitrogen) was 
added as carriers for the ChlP. During the entire pro- 
cedure, non-sticky eppendorf tubes were used (13-698- 
794; Fisher Scientific). 5 \ig antibody used was raised 
against ERa (SC-543; Santa Cruz), conjugated to 50 [A 
protein A Dynabeads (Invitrogen). After immunoprecip- 
itation, ten RIPA washes were performed (samples 
transferred to a fresh eppendorf during the last wash), 
followed by one TBS wash and reverse crosslinking, as 
described [17]. 

Solexa sequencing 

Chip DNA was amplified as described [17], using 18 
PCR amplification cycles. Sequences generated by the 
lUumina GAIIx genome analyzer (using 36-bp reads) 
were aligned against NCBI Build 36.3 of the human gen- 
ome using MAQ (http://maq.sourceforge.net/) with de- 
fault parameters. Peaks were called using model-based 
analysis for ChlP-Seq (MACS) [21] run using default pa- 
rameters. Read counts and duplication rates for all cell 
line and tumor samples are shown in Additional file 5: 
Table SI. The number of clonal reads in the carrier sam- 
ple was higher and may be a result of increased duplica- 
tion reads of remaining non-ligated adapters. 

Motif analysis, heatmaps and genomic distributions of 
binding events 

ChlP-seq data snapshots were generated using the Inte- 
grative Genome Viewer IGV 2.1 (www.broadinstitute. 
org/igv/). Motif analyses were performed through the 
Cistrome (cistrome.org), applying the SeqPos motif tool 
[22]. The genomic distributions of binding sites were 
analysed using the c/5-regulatory element annotation 
system (CEAS) [23]. The genes closest to the binding 
site on both strands were analysed. If the binding region 
is within a gene, CEAS software indicates whether it is 
in a 5'UTR, a 3'UTR, a coding exon, or an intron. Pro- 
moter is defined as 3 kb upstream from RefSeq 5' start. 
If a binding site is >3 kb away from the RefSeq transcrip- 
tion start site, it is considered distal intergenic. 



Tumour tissue handling 

Tissue for ChlP-seq analyses was from four newly diag- 
nosed breast cancer patients who were treated at the 
Netherlands Cancer Institute. Three specimens were col- 
lected under a prospective pre-operative endocrine therapy 
study approved by the institutional ethics committee. 
These patients provided written informed consent. Also 
one sample, taken from anonymous left over material, 
which would be discarded otherwise, was used. Since we 
are using anonymous, coded leftover material which is not 
traced back to the patients and does therefore not interfere 
with care and/ or prognosis, no ethical approval is required 
according to Dutch legislation [the Medical Research In- 
volving Human Subjects Act; http://www.ccmo-online.nl/ 
main.asp?pid=10&sid=30&ssid=51) and our institutional 
medical ethical review board. This conforms as described 
[24]. Three biopsies (14-gauge, 15 [im thick) were tal<en 
from each tumour, two of which where frozen with liquid 
nitrogen and one was formalin-fixed for direct clinical as- 
sessment. Tumour cell percentage was determined using 
Hematoxylin and Eosin (H&E), and tissue was only used 
for ChlP-seq analysis when tumour cell percentage was 
higher then 50%. All samples stained positive for Estrogen 
Receptor. Further information on the tumor samples and 
patient characteristics can be found in Additional file 4: 
Table S2. 

Gene expression analyses and survival data 

Genes with an ERa binding site within 20 kb from the 
transcription start site were interrogated. Estradiol- 
responsive genes were used from a publically available 
database [13]. Luminal and basal genes were previously 
identified [2]. For survival analyses, a publically available 
meta-analysis was used. ERa positive-tumours were se- 
lected from breast cancer patients who received adjuvant 
endocrine treatment [19]. 

Data deposition 

All genomic data is deposited at ArrayExpress under the 
accession number E-MTAB-1534. 

Additional files 



Additional file 1: Figure 51. Carrier ChlP QPCR er^richmer^t for the pS2 
promoter and negative control region. ChlP enrichment was calculated 
as percentage over input, both for the pS2-positive control (left panel) 
and for the negative control region (right panel). The RNA/histones 
carrier increases signal as the pS2 positive control region, while it 
diminishes signal at the negative control site. 

Additional file 2: Figure S2. Heatmap visualizations and signal 
quantifications for the three ChlP conditions. MACS peak caller was 
applied for sequencing data from samples without carrier (A), with the 
RNA/histones carriers (B) and the saturated ChlP (C). For each peak 
calling dataset, the corresponding genomic locations were tested for all 
three sequencing runs, as visualized in heatmaps (arrow head indicated 
centre of the peak, scale = 5 kb) and quantified in the 2D graphs. 
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Additional file 3: Figure S3. Peak subgroups and correlations between 
carriers/saturated ChIP samples. MACS peak caller was applied for 
sequencing data from samples with the RNA/histones carriers (A) and the 
saturated ChIP (B). Peaks were subgrouped in 'high' (I), 'medium' (II) and 
'low' (III), and raw signal intensity for the RNA/histones carriers and the 
saturated ChIP sequencing runs was analyzed. For each of the subsets, 
the corresponding genomic locations were visualized in heatmaps (arrow 
head indicated centre of the peak, scale = 5 kb) and quantified in the 2D 
graphs. 

Additional file 4: Table S2. Tumor characteristics, hormonal status and 
menopausal state. 

Additional file 5: Table SI. Sequencing reads and duplication rate. 
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